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LISTING OF CLAIMS 

1 . (currently amended) A computer implemented method of generating context 
vectors representing information items for retrieval of the information items or records 
containing the information items, the method comprising: 

assigning a context vector to each of a plurality of information items; 
initializing the context vectors such that the context vectors are substantially 

orthogonal to each other in a vector space; 
determining proximal co-occurrences of the information items; aR4 
adjusting the context vectors based on the proximal co-occurrences of the 

information items, such that the information items e l e m ent s that frequently 

proxirnally co-occur have context vectors with similar orientations in the vector 

spaceO ; and 

using said adjusted context vectors for retrieving said information items or 
records containing said information items. 

2. (Original) The method of claim 1 , wherein initializating the context vectors 
further comprises: 

assigning vector components to the vectors using zero-mean, unit-variance 
Gaussian random number generation. 

3. (Previously presented) The method of claim 1 , wherein a target context vector 
is a context vector assigned to a target information item, and a neighbor context vector 
is a context vector assigned to an information item that proxirnally co-occurs with the 
target context vector, and wherein adjusting the context vectors comprises: 

for each target context vector to be adjusted: 

determining an error vector between a target context vector and each 

neighbor context vector; 
updating the target context vector as a function of the error vectors. 
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4. (Previously presented) The method of claim 3, wherein updating the target 
context vector as a function of the error vectors comprises: 

determining a correction vector from the error vectors, where the correction 
vector is: 

ws 

where: 

Ejj is the error vector Ey = Njj - Tj between the neighbor context vector 

Njj and a target context vector T j; 
WS is a window size containing the target context vector and the 

neighbor context vectors; and 
q is a proximity constraint; and 
updating the target context vector as 

^ EW =Tf LD + ^r _E Cf-M 

where: 

Y is a step size; 

Fj is the total number of occurrences of information item j; and 
M is a mean context vector for all unique context vectors. 

.5. (Previously presented) The method of claim 1 wherein a target context vector 
is a context vector assigned to a target information item, and a neighbor context vector 
is a context vector assigned to an information item that proximally co-occurs with the 
target context vector, and wherein adjusting the context vectors comprises: 

determining a weighted sum vector of neighbor context vectors of a target 
context vector; 

applying the weight sum vector to the target context vector. 
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6. (Previously presented) The method of claim 5, further comprising: 
determining the weighted sum vector according to the equation: 



t u 1 



where: 



W is the weighted sum vector; 

Njj is the neighbor context vector to target context vector T j; 

G(i) is a Gaussian weight for the neighbor context vector i; and 

Dj is the number of documents that contain target information item j; 



and 



applying the weighted sum vector to the target context vector according to the 
equation: 



where: 

jnew j S ^ e ^^3^3 target context vector; and 
T f OLD is the un-updated target context vector. 

7. (Previously presented) The method of claim 1 wherein a target context vector 
is a context vector assigned to a target information item t and a neighbor context vector 
is a context vector assigned to an information item that proximally co-occurs with the 
target context vector, and wherein adjusting the context vectors comprises: 

determining a weighted sum vector of neighbor context vectors of a target 



context vector according to the equation 




where: 
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W is the weighted sum vector; 

G(i) is a Gaussian weight for the neighbor context vector i; and 
Dj is the number of documents that contain target information item j; 
determining an error vector from the weighted sum vector and the target context 
vector: 



where 

E is the error vector; 
T is target context vector; 
determining a correction vector C from the error vectors of the neighbor context 
vectors: 

Cj= X E: 
applying the correct vector to the target vector: 

where: 

Tj NEW is the updated target context vector; and 
Tj 0LD is the un-updated target context vector. 
Y is a step size; and 

M is a mean context vector for all unique context vectors. 

8. (Original) The method of claim 1, wherein the vector space is defined by a plurality of 
axes, and the context vector include vector components corresponding to the respective 
axes, and wherein the axes individually do not have specific semantic associations. 

9. (Previously presented) The method of claim 1, wherein a target context vector 
is a context vector assigned to a target information-item, and a neighbor context vector 
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is a context vector assigned to an information item that proximally co-occurs with the 
target context vector, and wherein adjusting the context vectors comprises: 

adjusting the target context vector as a function of the relative importance of at 

least one of either the target information element or the neighbor context 

vector with respect to the plurality of information item. 

10. (Previously presented) The method of claim 1, wherein a target context 
vector is a context vector assigned to a target information item, and a neighbor context 
vector is a context vector assigned to an information item that proximally co-occurs with 
the target context vector, and wherein adjusting the context vectors comprises: 

adjusting the target context vector as a function of the frequency of occurrence of 
the target information item; the frequency of occurrence of the neighbor 
information item; a total number of records containing the target information 
item, and a total number of record containing the neighbor information item. 

1 1 . (Previously presented) The method of claim 1 , wherein a target context 
vector is a context vector assigned to a target information item, and a neighbor context 
vector is a context vector assigned to an information item that proximally co-occurs with 
the target context vector, and wherein adjusting the context vectors further comprises: 

adjusting the target context vector as a function of: 

a distance of the neighbor information item from the target information 
item, so that neighbor information items that are closer to the target 
information item cause the target context vector to be adjusted to 
be closer to the neighbor context vector; and 

a frequency of occurrence of the neighbor information item in records 
containing the information items, such that a neighbor information 
item that is less frequently occurring more strongly causes the 
target context vector to be adjusted to be closer to that neighbor 
context vector than a neighbor information item that is more 
frequently occurring. 
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12. (Previously presented) The method of claim 1, further comprising: 

for at least one record comprising a plurality of information items, determining a 
summary context vector for the record from the normalized sum of the context 
vectors of the information items that comprise the record. 

13. (Previously presented) The method of claim 12, further comprising: 
receiving a query comprising at least one information item; 

generating a query context vector from the information item that comprise the 
query; 

retrieving at least one record having a summary context vector with a orientation 
in the vector space similar to the orientation of the query context vector. 

14. (Original) The method of claim 13, wherein retrieving at least one record 
having a summary context vector with a orientation in the vector space similar to the 
orientation of the query context vector further comprising: 

a tree walk of a cluster tree, the cluster tree comprising a hierarchical plurality of 
nodes, each node having a cluster centroid vector, each cluster centroid 
vector associated with a cluster of one or more records, and derived from the 
one or more records contained in the cluster, the tree walk performed by 
iternatively selecting a node of the cluster tree that has a centroid cluster 
vector with a closest orientation in the vector space to the query context 
vector. 

15. (Original) The method of claim 1 , further comprising: 

clustering the context vectors into a plurality of clusters, each cluster having a 
centroid vector derived from the plurality of context vector contained in the 
cluster. 

16. (Previously presented) The method of claim 1, wherein: 
information items having similar meaning have context vectors with similar 

orientations in the vector space. 
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17. (Previously presented) The method of claim 1, further comprising: 
determining a similarity of meaning between a first information item and a second 

information item by performing a vector operation on the context vectors of 
the first and second information items. 

18. (Currently amended) A computer implemented method of generating vectors 
representing information items for retrieval of the information items, the method 
comprising: 

selecting a set of R information items in a database : 

determining for the selected set of information elements an RxR mutual co- 
occurrence matrix based on proximal co-occurences of the information items 
in a plurality of documents; 

applying Singular Value Decomposition to the mutual co-occurrence matrix to 
produce a set of first context vectors , th e first cont e xt v e ctors h a ving 
ori e ntations i n a D dimens i ona l voctor s pac e , wh e r e D«R ; and 

wherein each first context vector is uniquely associated with one of the selected 
information items, and wherein information items having similar meaning have 
respective first context vectors with similar orientations in the vector space[[.]]; 
and 

wherein said context vectors are used for retrieval of information items from said 
database. 

19. (Previously presented) The method of claim 13, wherein the mutual co- 
occurrence matrix comprises for each pair of selected information items, a normalized 
measure of the frequency of proximal co-occurrence of the pair of selected information 
items. 

20. (Previously presented) The method of claim 13, wherein the selected set of 
information items comprises a first selected set, the method further comprising: 

selecting a second set of information items different from the first selected set of. 
information items ; 

- 16- 
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associating each of the second set of information items with a second context 
vector; and 

updating the second context vectors at least once using the first context vectors, 
wherein the first context vectors are fixed during at least one update of the 
second context vectors. 

21 . (Currently amended) A computer implemented method of retrieving a record 
from a database containing a plurality of records, each record containing at least one 
information item having an associated context vector, the method comprising: 

for each of a plurality of information items, storing context vectors associated with 
the information item the context vectors having the properties that information 
items having similar meaning have context vectors with similar orientations in 
a vector space, and information items having dissimilar meanings have 
context vectors with dissimilar orientations in the vector space; 

for each of the plurality of records from said database containing said plurality of 
records , storing a summary context vector derived from context vectors 
respectively associated with information items that comprise the record; 

receiving a query; 

deriving at least one query information item from the query; 
generating a query context vector from the query information item; and 
selecting at least one record from said database having a summary context 

vector with orientation in the vector space that is similar to the orientation of 

the query context vector. 

22. (Original) The method of claim 21 , wherein selecting at least one record 
further comprises: 

for each of a plurality of records, determining a distance in the vector space 
between the query context vector and a summary context vector of a record; 
and 

selecting the record having a least distance between its summary context vector 
and the query context vector. 
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23. (Currently amended) A computer implemented method of providing a 
universal meaning space for human understandable information items, the method 
comprising: 

selecting a set of first information items in a corpus of records in a data storage ; 

creating a first set of context vectors based on proximal co-occurrences of the 
first information items in corpus of r e cords , each first context vector uniquely 
associated with one of the first information items, the context vectors having 
an orientation in a vector space, such that first information items having 
similar meaning have context vectors with similar orientations in the vector 
space; 

selecting a set of second information items in said data storage , the second 

information items being different from the first information items; 
selecting a subset of the first information items; 

for each first information item in the subset, selecting a corresponding second 

information item having a human understandable meaning substantially 

identical to the meaning of the first information item; 
for each of the selected second information items, associating the second 

information item with the context vector of the corresponding first information 

item; 

assigning a context vector to each non-selected second information item; and 
adjusting the context vectors of the non-selected second information items using 

the context vectors of the selected second information items ; and 
using anv of said context vectors for later retrieval of said first information items . 

24. (Previously presented) The method of claim 23, wherein: 

the first information items are words of a first human language, and the second 
information items are words of a second, different human language; and 

the subset of first information items and the corresponding second information 
items have substantially identical meaning. 
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25. (Previously presented) The method of claim 23, wherein: 

the first information items are symbolic representations of words of a human 
language encoded in a first data format, and the second information items are 
symbolic representations of non-text data encoded in a second data format 
different from the first format; and 

the subset of first information items and the corresponding second information 
items have substantially related meaning even though they have different 
data formats and different symbolic representations. 

26. (Previously presented) The method of claim 23, wherein adjusting the 
context vectors of the non-selected second information items further comprises: 

adjusting the context vectors of the non-selected second information items , such 
that non-selected second information items and selected second information 
items having similar meaning have context vectors with similar orientations in 
the vector space. 

27. (currently amended) A computer-implemented process method of generating 
a dictionary of information items for a database of records, each record including at 
least one information item, each information item associated with a context vector, each 
information item having a determinate proximity to other information items in a record, 
wherein a neighbor information item is an information item that occurs proximate a 
target information item in at least one record in the database, the method comprising: 

initializing the context vectors associated with information items in the dictionary 
of information items for said database of records , such that initial context 
vectors are substantially orthogonal to each other in a vector space; and 
for each information item being a target information item: 

selecting neighbor information items of the target information item in at 

least one record; and 
modifying the context vector of the target information item using the 
context vectors of each selected neighbor information items as a 
function of the proximity of each neighbor information item to the 
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target information item, and a co-importance of the target 
information item and the neighbor information ttem[[.]] 
wherein said dictionary is used for retrieving information items 
representing media during a search, 

28. The method of claim 27, further comprising: 

determining the co-importance according to the relative importance of the target 
information item and the relative importance of the neighbor information item. 

29. The method of claim 28, wherein determining the co-importance comprises: 
determining a first relative importance of the target information item, inversely 

according to the frequency of occurrence of the target information item in the 
records; 

determining a second relative importance of the neighbor information item 
inversely according to the frequency of occurrence of the neighbor 
information item in records; and 

determining the co-importance as a function of the first relative importance and 
the second relative importance. 

30. (Previously presented) The method of claim 29, further comprising: 
determining the. relative importance of an Information item by the equation: 

log<-~) 
ND . 

/,« e+<i-sxi- r*-) 

log(— — ) 
TND 

wherein: 

lj is the relative importance of information item J; 
B represents a predefined lower bound; 

NDj represents the number of records containing information item J; 
and 

TND represents the total, number of records. 
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31. (Previously, presented) The method of claim 27, wherein the co-importance 
of the target information item and the neighbor information item is determined using the 
equations: 

1) Ctn = ItIn 

wherein: 

Ctn is the co-importance of the target information item and the 

neighbor information item; 
It is the relative importance of the target information item; and 
In is the relative importance of the neighbor information item; and 

2) / ; - B + <1-fl)<1- 

log (— — ) 
TND 

wherein: 

lj is the relative importance of an information item J; . 
B represents a predefined lower bound; 

NDj represents the number of records containing information item J; 
and 

TND represents the total number of records. 
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32. (Previously presented) The method of claim 27, wherein a proximity constraint 
varies a magnitude of the modification to the context vector of the target information 
item as a function of both the frequency of occurrence of the target information item and 
the frequency of the occurrence of each neighbor information item in the records, so 
that the context vectors of information items that frequently proximally co-occur do not 
converge. 

33. (Currently amended) In a computer system including a storage device 
containing a plurality of records, each record containing a plurality of information items, 
a computer readable medium for configuring and controlling the computer system to 
generate a plurality of context vectors, the computer readable medium comprising: 
an initial context vector generation module, adapted to read and write to the 
storage device, which initializes to each of a plurality of selected information 
item items an initial context vector, such that the initial context vectors are 
substantially orthogonal to each other in a vector space, and which writes the 
initial context vectors to the storage device in association with respective 
information items; 

a vector training module, adapted to read and write to the storage device, for 
modifying the context vector of a selected information item, being a target 
information item, using the context vectors of neighbor information items that 
proximally co-occur with the target information item, as a function of the 
proximity of each neighbor information item to the target information item, and 
a co-importance of the target information item and the neighbor information 
item[[.]]i 

wherein said computer system is configured to use said modules in retrieving 
information items or records containing said information items, said 
information items representing media. 
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34. (Currently amended) A method of automatically indexing documents using a 
defined index of terms, the method comprising: 

providing an indexed collection of documents in a database , each document 
having at least one index term assigned to the document; 

providing a plurality terms, including the index terms, each term associated with a 
context vector, the context vector having the properties that that terms having 
similar meaning have context vectors with similar orientations in a vector 
space, terms having dissimilar meanings have context vectors with dissimilar 
orientations in the vector space, and terms which frequently proximally co- 
occur have context vectors with similar orientations in the vector space; and 

generating for each indexed document a context vector from the context vectors 
of selected terms that comprise the document; 

receiving a new document in said database to be indexed; 

generating a new context vector of the new document, the new context vector 
generated from the context vectors of selected terms that comprise the new 
document; 

selecting at least one indexed document having a context vector similar to the 
new context vector; 

assigning to the new document at least one index term assigned to a selected 

indexed document[[.]k 
such that said new document can later be retrieved from said database. 

35. (Original) The method of claim 34, wherein assigning to the new document 
at least one index term further comprises: 

for each selected indexed document, assigning a weight to each index term 
assigned to the indexed document, the weight proportional to the similarity 
between the new context vector and the context vector of the indexed 
document, such that the weight is higher where the context vectors are more 
similar; 

for each index term, generating an index term score as a function of a number of 
occurrences of the index term in each selected indexed document, and the 
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weight of the index term with respect to each selected indexed document, 
such that the index term score is higher as the number of occurrences of an 
index term increases; and 
assigning to the new document at least one of the index terms with a high index 
term score. 
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